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PROTEIN TYROSINE KINASE AND L I GANDS THEREOF 

The present invention relates to a putative signalling 
5 molecule. More particularly the invention relates to a novel 
tyrosine kinase and the use of this tyrosine kinase in the 
development of diagnostic and therapeutic approaches to 
cancer, for example breast cancer. 

i 

10 Protein tyrosine kinases are enzymes which show the 

property of catalysing the transfer of phosphate groups from 
donor molecules (ATP) to the hydroxy 1 groups of tyrosine 
residues in polypeptides. Known tyrosine kinases can be 
classified into two broad groups. Transmembrane tyrosine 

15 kinases traverse cellular membranes so that they possess 
extracellular and intracellular domains ♦ cytoplasmic tyrosine 
kinases are located only intracellular ly. A general feature 
of transmembrane (also referred to as receptor) tyrosine 
kinases is that they possess extracellular ligand-binding 

20 domains, hydrophobic transmembrane sequences, and 
intracellular portions which include the tyrosine kinase 
domains (for a review see Ullrich & Schlessinger Cell, .61, 
203-212 (1990)). 

25 Tyrosine kinases can induce cell proliferation, cell 

transformation and regulate developmental events (see reviews 
by Hanks et al, Science, 241 , 42-75 (1988) and Cant ley et al, 
Cell 64* 281-302 (1991)). In general, where functional assays 
have been available (e.g. mitogenesis or transformation), it 

30 has been shown that the biological functions of tyrosine 
kinases are usually dependent on intact enzyme activity and 
that through autophosphorylation and the phosphorylation of 
other proteins, they alter the subcellular localisation and 
activities of various components of the intracellular 

35 signalling pathways. 

Analysis of the oncogenes of many acutely transforming 
animal retroviruses has revealed that their products 
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15 



frequently manifest tyrosine kinase activity, as do the 
products of their cellular proto-oncogene counterparts, other 
genes encoding tyrosine kinases have been found to be altered 
by DNA rearrangements in cancer cells, the result being a 
presumed acquisition of cellular transforming activity (for 
example c-abl, c-met) . some cellular proto-oncogenes encoding 
tyrosine kinases have been cloned independently by virtue of 
the fact that they encode growth factor receptors, for example 
the epidermal growth factor receptor. On the other hand, the 
genes for other growth factor receptors which have tyrosine 
kxnase_ activities ,_such as those for platelet-derived growth 
factor and ' insulin-like growth factors, have been well 
characterised, but, have never been found to be transduced by 
retroviruses . 



The c-erl,B-2/HER2/c-neu. gene , encodes a transmembrane 
receptor-like tyrosine kinase which is structurally very 
similar to the EGF/TGF alpha receptor. The c-erbB-2 gene has 
been found to be overexpressed in 20 to 30% of human breast 
tumours, often in association with gene amplification, and 
this phenotype is now generally accepted as predictive of poor- 
disease free and overall survival (for reviews see Sunderland 
& McGuire in Regulatory Mechanisms in Breast Cancer, Lippman 
& Dickson (Eds.), Kluwer Academic Publishers, Boston, pages 
25 3 to 22 (1991) and Gusterson et al., j. Clin. Oncol., io 
1049-1056 (1992)). Overexpression of the gene in fibroblasts 
induces transformation (di Fiore et ai, Science, 232, 178-182 
(1987) and Hudziak et al, Proc. Natl. Acad. Sci. (USA) , M 
7159-7162 (1987) ) , and ligands which bind to the c-erbB-2 gene 
product (human and rodent, and activate its tyrosine kinase 
activity have been identified (Lupu etal, Science, 249 
1552-1555 (1990), Dobashi et al, Proc. Natl. Acad, Sci. (USA) ' 
tt, 8582-8586 (1991), Wen et al, Cell, 559-572 (1992, and 

Holmes et al, Science 25J>, 1205-1210 (1992)). some of these 
ligands increase the rate of proliferation of cells expressing 
the c-e^bB-2 protein. The strong implication of these and 
other findings is that overexpression of this tyrosine kinase 
m some breast tumour cells is one of the important steps in 
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their progression towards tumourigenicity , and therefore that 
c-erbB-2 can function as an oncogene. 

A variety of other receptor tyrosine kinases, including 
5 those for the IGFs and the FGFs, are expressed, in breast 
tumours (Stewart et al, J. Biol.' Chem. , 265 21172-21178* (1990) 
and Wellstein & Lippman in Molecular 1 Foundations of Oncology, 
Broder (Ed.)/ Williams and Wilkins, Baltimore, pages 403-418 
(1991)), and the EGF/TGF alpha receptor is overexpressed in 

10 some cases (for example Horak et al, Oncogene, j>, 2277-2284 
(1991)). As with c-erbB-2, overexpression of EGFr has been 
correlated with poor prognosis. Since the ligands of some of 
these receptors can be detected in breast tumour samples, it 
may be that breast tumour development is also regulated by 

15 these molecules (for a review see van de Vijver & Nusse, 
Biochim, Biophys. Acta, 1072 . 35-50 (1991)). 

The features which are ideally required in a molecular 
target for tumour therapies include preferential expression 
20 in the tumours, a role in the development of the tumours 

(rather than merely being a marker) , and a knowledge of the 

i 

mechanism of its action as a starting point for the rational 
design of activity modulators. It appears that all of these 
three criteria are satisfied, at least to some extent, in the 
25 case of c -erb B-2 and breast cancer and a considerable research 
effort is currently being directed to this molecule. 

It has recently been found that the alkaloid K252a, at 
appropriate concentrations, selectively inhibits the tyrosine 

30 kinase and biological activities of the trk-class of 
neurotrophin receptors, but has no effect on the activities 
of v-src, v- fros or the receptors for EGF and PDGF, and also 
does not affect general cell viability or proliferation 
(Tapley et al, Oncogene, 7, 371-381 (1992)). Members of 

35 another class of molecules, the tyrphostins, may be selective 
inhibitors of the EGFr/c- erb B-2 class of tyrosine kinases 
(Gazit et al, J. Med. Chem., 34# 1896-1907 (1991)). An 
antibody which inhibits the proliferation of human tumour 
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cells by binding to the c-erhR-5 • 

i i-i = +- • erfeB-2 protean is currently under 

ev.luat.on as a potential cancer therapy (Carter et al , Pr oc 
Natl. Acad. SCI. (USA) , 89, 4285-4289 (1992) ). 

5 If the transformation of a significant minority of breast 

a~ty c of xs t h ls due ^ at ieast in part - to «» 

that other * ***** «"» " S «"* 

oteL . tUm ° UrS SSPeCts ° f their transformed 

alt ".I" fUrther tUn ° Ur cosine Kinases 

would allow the development of novel dia 9 nostic and 

tlours aPPr ° aCheS " Can « r - *» >reast 
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20 



25 



30 



35 



tyrost! * P " Sent inventi °" " *° P-vide a novel 

tyrosme kinase and its use in the diagnosis and/or therapy 
of cancer, particularly human breast tumours. ' 

* PreSent Prides a DMA isolate encoding a 

novel protean tyrosine kinase or a fragment thereof, L 

se!! T" ne kiMSe bein9 *■*«*«*— "y the amino acid 
sequence shown in 

SEQ ID NO. 2 

or by an amino acid sequence showing a significant degree of 
lZllT ic T ret °' Pr6ferably at leaSt "* oology in the 
of T T*™ ^ l6aSt 4 ° % h0m010 ^ in th * remainder 

overall 7 ^ Prefe " bl * -t least 90% homology 

overall, for example at least 95% homology. 

According to one embodiment, the DNA isolate encodes the 
fuH amxno acid sequence of the novel protein tyrosine kinase 
or an amxno acid sequence showing a significant degree of 
homology thereto. According to another embodiment the DNA 
isolate encodes a fragment of the amino acid sequence of the 
protexn tyrosine kinase or an amino acid sequence showing a 
sxgnxfxcant degree of homology thereto. DNA sequence encoding 
fragments of the protein tyrosine kinase preferably encodes 
those parts of the amino acid sequence which characterise the 
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enzyme, i.e. those parts which are most distinct from other 
protein tyrosine kinases. 

The DNA sequence and deduced amino acid sequence of the 
5 novel protein tyrosine kinase are shown in SEQ ID NO. 1. Most 
preferably the DNA sequence encodes all or part of the 
catalytic domain of the enzyme. The catalytic domain is 
encoded by nucleotides 617-13.24 of SEQ ID NO. 1. - 

i 

10 The DNA isolate may have the base sequence defined in SEQ 

~ XD — N °-« 1 — encode -the — reievant — amino — acid— sequence. 
Alternatively the .DNA isolate may have any other DNA sequence 
encoding the relevant amino acid sequence. 

15 According to another embodiment, the invention provides 

a DNA isolate encoding amino acids l to 77 of SEQ ID' NO. 2 
followed by the amino acids of SEQ ID NO. 12. This coding 
sequence of this isolate may have the base sequence of 
nucleotides 29 to 552 inclusive of SEQ ID NO. 1, but excluding 

20 nucleotides 25? to 380 inclusive, or any other base sequence 
encoding the relevant amino acid sequence. 

The present invention also provides a DNA isolate in the 
form of a cloning vector or an expression vector preferably 
25 a plasmid vector, including DNA as defined above. In the case 
of an expression vector the DNA will be under control of an 
appropriate promoter and will include regulatory elements 
required for expression in a suitable host cell. 

30 The present invention also includes a recombinant cell 

line transformed with an expression vector as defined above 
and capable of expressing a recombinant protein tyrosine 
kinase or a fragment thereof characterised by the amino acid 
sequence shown in 

35 SEQ ID NO. 2 

or an amino acid sequence showing a significant degree of 
homology thereto. 
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Suitable host cells include .mammalian cells, insect 
cells, yeast cells and bacterial cells and the expression 
vector will include a promoter and other regulatory elements 
appropriate to the host cell in question. Preferred host 
5 cells include CHO cells, myeloma cells, immortalised human 
breast cells, rodent fibroblast cell lines, baculovirus cells, 
yeast and >E. coli cells. General techniques for manipulating 
DNA coding sequences and expressing, such sequences in various 
types of cells are well known to those skilled in the art and 
10 are described for example in Sambrook et al , Molecular 
Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor 
Laboratory. ' 

The present invention also provides a protein tyrosine 
15 kinase or a fragment thereof characterised v by the amino acid 
sequence shown in 
SEQ ID NO. 2 

or an amino acid sequence showing a significant degree of 
homology thereto*. 

20 

The protein tyrosine kinase or' a fragment thereof may be 
a recombinant polypeptide produced as the expression product 
of a coding sequence as defined above in a recombinant cell 
line. Alternatively the protein tyrosine kinase may be 

25 produced in a human tumour cell line. In either case the 
protein may be extracted and purified by standard techniques, 
for example antibody affinity chromatography. The 
glycosylation of the protein (if any) will depend on the cells 
in which it is produced and protein produced in human tumour 

30 cells in culture should have glycosylation equivalent to that 
produced in vivo in human tumours. 

The DNA sequence set out in 
SEQ ID NO. 1 

35 represents DNA sequence and deduced amino acid sequence from 
a novel tumour protein tyrosine kinases. 

The DNA sequences encoding the novel protein tyrosine 
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kinase has been isolated in the manner described in more 
detail in the example set out below but which involves briefly 
the following steps: 

i) isolation of mRNA from metastatic tumour tissue; 
5 ii) preparation of cDNA from the isolated mRNA; 

iii) PCR amplification using degenerate oligonucleotide 
primers designed to amplify sequence associated with 
protein tyrosine kinases; 
iv) subcloning PCR products; 
10 v) identification of protein tyrosine kinase products 
amplified, at higher leve-ls^from-tumour bearing tissue; 
vi) ONA sequencing and identification of novel protein 
tyrosine kinase sequence; 

vii) isolation of the putatively full length cDNA from a cDNA 
15 library. 

Application of the above protocol identified the above 
novel protein tyrosine kinase. The enzyme, which is believed 
to be a cytoplasmic tyrosine kinase, has been found to be 
20 overexpressed in certain human breast tumours as compared to 
normal breast tissue but has not been detected at all in other 
types of human tissue. 1 



25 



30 



The present invention also relates to the application of 
the novel tyrosine kinase for the development of therapeutic, 
prognostic and diagnostic approaches to cancer. The invention 
is particularly applicable to breast cancer, however, the same 
approaches may also be applicable to other cancers and 
research data suggests a connection between breast cancer and 
ovarian cancer and also between these two cancers and gastric 
cancer . 

In terms of therapy, the involvement of the novel 
tyrosine kinase in tumours means that beneficial clinical 
effects in the treatment of tumours can be obtained by 
modulating the tyrosine kinase activity and/or the 
functionality of the signalling molecule. There are a number 
of ways in which such modulation could be achieved. 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO_9502057A1J_> 



35 



WO 95/02057 



PCT/GB94/01479 



8 

The activity of the protein tyrosine kinase could be 
significantly impaired or inhibited by small chemical 
molecules and screens to identify suitable small molecular 
weight inhibitors can be developed in more detail below. This 
5 approach is analogous to that proposed for other tyrosine 
kinases (see "Drugs of the Future", 17f2) . 119-131, (1992)). 
A further embodiment of this approach would be to derive nmr 
spectra or crystal structures for the protein tyrosine kinases 
or domains thereof and use the structural information so 
10 obtained to synthesise chemical structures de novo which could 
be similarly screened for activity as inhibitors. 

The approach described above could be extended to 
encompass small peptides which are either competitive for 

15 signalling action of the tyrosine kinase, or which demonstrate 
useful binding thereto, such that its function is inhibited. 
This would include, for example, peptides which are capable 
of blocking substrate binding to the signalling molecule, 
including but not limited to those which bear SH2 domains. 

20 An additional approach is to use small molecules or peptides 
to block or interfere with ligand activation or dimerisation 
of the tyrosine kinase. Both of these events are generally 
required to effect a biological signal so that blocking or 
inhibition thereof could be used to therapeutic effect. 

25 

Protein tyrosine kinase function could also be blocked 
by expressing or administering a peptide which is a truncated 
or altered version of the protein. These are typically 
referred to as dominant negative proteins and are believed to 
30 sequester the active protein in a non-functional complex. 

Protein tyrosine kinase function could also be modulated by 
blocking translation of the mRNA encoding the protein using 
antisense oligonucleotides. 

35 A further regulatory role of the protein tyrosine kinase 

could be manipulated by controlling expression or activity of 
specific phosphatases which control the degree of protein 
phosphorylation. It is recognised that the phosphorylation 
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of specific residues, particularly tyrosine, threonine and 
serine residues, play an important role in transmitting a 
biological signal from the tyrosine kinase. 

Use of the above therapeutic approaches depends on the 
identification and/or development of suitable agents for 
modulating, for example inhibiting, 1 particular functions or 
activities of the protein tyrosine kinase. The- present 
invention also relates to assays or screens by which such 
agents can be identified. * 

The present invention also provides a method of screening 
a substance for potential utility as a therapeutic agent in 
the treatment of cancer, in particular breast cancer, which 
comprises providing a standard system in which a protein 
tyrosine kinase as defined above or an active fragment thereof 
is able to develop a measurable effect/ allowing the protein 
tyrosine kinase to develop that effect in the presence and 
absence of the said substance and measuring that effect , 
ability to produce significant inhibition of the effect being 
taken as an indication of potential utility as a therapeutic 
agent . 1 

In accordance with this general principle, screens for 
25 potential therapeutic agents can take a number of forms. For 
example an initial screen to determine whether or not a 
substance merits further investigation as a potential 
inhibitor of the protein tyrosine kinase will usually be 
biochemical and should preferably be simple, rapid and capable 
30 of high through-put. Such screens will often make use of the 
protein expressed in a recombinant or derived from a cellular 
source in which the protein is over expressed. Particularly 
suitable recombinant expression systems include insect 
baculovirus, and the use of immortalised human breast cell 
35 lines or rodent fibroblast cell lines as host cells. The 
protein is then used directly in an in vitro assay with and 
without the potential inhibitor. This gives rapid data on the 
performance of the substance as an inhibitor often providing 
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direct evidence that the substance can i„h ibi t the enzymic 
activity of the protein tyrosine kinase and thus merits 
further study. This type of isolated system can also provide 
data on specificity but will not generally provide any 
information on bioavailability. The enzyme activity of the 
protein tyrosine kinase in vitro can be measured, for example, 
by^measunng autophosphorylation or phosphorylation of a model 
substrate, by following incorporation of radioactive phosphate 
or by using anti-phosphotyrosine antibodies. 

Further information concerning potential inhibitors can 
be provided by cell based screens which make use of a 
Phenotypic alteration, e.g. a change in morphology and/or 
tumorigenicity, conferred by expression of' the protein 
tyrosine kinase in a recombinant cell line or over-expression 
in any other available cell line. Particularly suitable 
recombinant cell lines again use immortalised human breast 
cell lines or rodent fibroblast cell lines as host cells. The 
cells can be used in proliferation or tumorigenicity assays 
with and without the potential inhibitor looking for 
substances able to halt growth of and/or morphologically de- 
transform the cells. a control for substances which are 
generally cytotoxic can be provided by the same parental cell 
line engineered to express another oncogene which transforms 
the cells at a signalling point downstream of the protein 
tyrosine kinase. Assays of this sort can be very informative 
and they may provide data on mode of action of inhibitors. 

The final stage of screening is the development of animal 
screens. These are time consuming and expensive and for these 
and ethical reasons their use is kept to a minimum, however 
they can give vital information regarding metabolism, 
clearance and performance of a substance in a true in vivo 
system which cannot be obtained in other ways. Animal screens 
ideally use the same recombinant or other cell lines as 
described above which express the protein tyrosine kinase. 
The cells are used as xenografts in animals, for example, nude 
mice, to give tumours whose growth is controlled by expression 
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of the protein tyrosine kinase. The mice are then given 
inhibitors to determine whether they are capable of causing 
the selective regression of the xenografts without blocking 
the growth of a control xenograft. Alternatively transgenic 
5 animals which develop tumours by virtue of the appropriate 
tissue specific expression of the target protein tyrosine 
kinase can be used in a similar manner. 

The invention also extends to therapeutic agents 
10 identified by use of any or all of the screens referred to 
above. Preferably the therapeutic agent is a chemical molecule 
of relatively low molecular weight, for example, less than 
about 1000. Examples of suitable classes of molecule include 
staurosporine analogues, tyrphostins and flavenoids. The 
15 therapeutic agent can also be peptide or an antisense mRNA 
encoding the protein tyrosine kinase. 1 

The invention also extends to an enzyme-substrate complex 
which comprises a protein tyrosine kinase characterised by the 
20 amino acid sequence shown in 

SEQ ID NO. 2 ' 
or by an amino acid sequence showing a significant degree of 
homology thereto and a therapeutic agent capable of modulating 
the activity of the said protein tyrosine kinase. 

25 

The novel protein tyrosine kinase can also be used in 
prognostic and diagnostic applications. Thus antibodies 
raised against the protein tyrosine kinase or nucleic acid 
encoding the novel tyrosine kinase can be used as the basis 
30 for screening tissue, in particular tumour tissue, for the 
presence of the novel protein tyrosine kinases. If, as in the 
case of c-erbB-2, the expression of the tyrosine kinase 
correlates with poor prognosis, then treatment of the patient 
could be modified accordingly. 

35 

Accordingly the present invention provides a method for 
detecting a tyrosine kinase having an amino acid sequence as 
defined in 
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SEQ ID NO. 2 

or a fragment thereof which comprises reacting a test sample 
with a specific antibody raised against an antigen fro, the 

"tLr n0 , aC K ld , SeqU6nCe d6teraini ^ tether there is any 
antigen-antibody binding within the test sample The test 
sample may be for example a tissue sample, such as a tumour 
sample and presence of abnormal , amounts of the protein 
tyrosxne kinase may indicate development of or susceptibility 
to develop a tumour. 

The invention also provides a method for the detection 
xn a sample of a nucleic acid sequence encoding a tyrosine 
kinase which comprises subjecting the sample or nucleic acid 
isolated therefrom to a method capable of detecting a nucleic 
acid sequence therein identical or substantially homologous 
to a defined nucleic acid sequence and identifying any 
sequence identical or substantially homologous to all or part 
or the DNA sequence defined in 
SEQ ID NO. 1, 

llar*T &nC * an ° ther Part(?) ° f the cosine kinase 

characterised by that sequence or mRNA which would be the 
transcription product of such DNA. Again presence of abnormal 
amounts of nucleic acid encoding the protein tyrosine kinase 
»ay indicate development of or susceptibility to develop a 
tumour • 



Suitable methods for the detection of specific DNA 
sequences include Southern blotting and/or pgr using 
appropriate primers. Suitable methods for detecting mRNA 
include Northern blotting, RNAse protection studies and direct 
nucleic acid in situ hybridisation. 

Experimental work on which the invention is based is 
described in more detail in the following Example in which 
reference is made to the following figures in which: 

Figure 1 shows expression of brk transcripts in human tissues 
and breast tumour cell lines; 
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Figure 2 shows a schematic representation of the human brk 
cDNA; 

Figure 3 shows baculovirus and bacterial protein expression 
of brk ; 

Figure 4 shows detection of brk mRNA expression in normal 
breast and grade III breast carcinoma tissue by* RNase 
protection; 1 

Figure 5 shows detection of brk mRNA expression in normal 
breast and grade ill breast carcinoma tissue by PCR. 
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EXAMPLE 



Materia ls and Methods 
5 Tissue samples 

Human lymph node and tumour samples were obtained from 
surgery and rapidly frozen in liquid nitrogen. Tumour grading 
was in accordance with accepted national criteria (Pathology 
Reporting in Breast Cancer Screening, Draft Guidance. 
Prepared by a Department of Health and Royal College of 
Pathologists Working Group (1989)) Breast Screening 
Publication ISBN 187997101. Normal breast tissue was obtained 
from reduction mammoplasty and rapidly frozen in liquid 
nitrogen. Frozen tissue samples were stored either in liquid 
nitrogen or at -70 °C. 



Cell lines 



Breast tumour cell lines were obtained from the American 
Type Culture Collection. Cell lines were maintained under 
conditions recommended by the supplier. 

RNA purification and Northern hybridisation 



Total RNA was purified from tissue samples using the 
method of Chomczynski and Sacchi (Anal. Biochem. , ±£2_, 156-159 
(1987) ) . Cytoplasmic RNA was extracted from cell lines by the 
method of Wilkinson (Nuc. Acids. Res., 16, 10933 (1988)). 

30 Poly a* mRNA was purified using oligo dT spun columns 
(Pharmacia) according to manufacturer's instructions. RNA 
samples were electrophoresed through forma ldehyde-agarose (1%) 
gels and transferred to Hybond N membranes (Amersham) . The 
human tissue poly A* Northern blot was purchased from 

35 Clontech. 32 P radiolabeled probes were generated as 
previously described (Feinberg and Vogelstein, Anal. Biochem., 
132, 6-13 (1983)) and hybridised to membranes according to 
manufacturer's instructions. Membranes were washed at 65 "C 
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at a final stringency of 0.1 x SSC, 0.1% SDS. 

Reverse transcriptase-polymerase chain reaction (RT-PCR) , and 
PTK differential screen 

5 

For the production of single stranded cDNA, 5 /ig of RNA 
were digested with 7.5 units (u) RNase-free DNase I in 10 ^1 
for 10 mins. at 37 °C in the presence of 40u human placental 
RNase inhibitor. After extraction with phenol/ chloroform and 

10 chloroform, the RNA was precipitated, resuspended in 10 pi 
-^wat er , ^denatuxed~^t b5 6 C~~f or 5 "loins . , and stored on ice. 
Reverse transcription reactions (10 pi) contained 2.5 pg RNA, 
40 ng pdN6 (Pharmacia) , lmM dNTPs, 40u RNase inhibitor, 30u 
AMV reverse transcriptase XL (Life Sciences Inc.) and reverse 

15 transcriptase buff er produced by the enzyme supplier. Control 
reactions contained no 'reverse transcriptase. Reaction^ were 
carried out at 41 °C for 1 hour. 

Degenerate deoxyoligonucleotides were synthesised to 
20 amplify PTK encoding cDNA sequences. 
SEQ ID NO. 3 (sense) 
SEQ ID NO. 4 (antisense) 
These primers are based on the following amino acid sequences 
SEQ ID NO. 5 
25 SEQ ID NO. 6 

respectively. 

Amplifications were carried out in 50 pi volumes 
containing 2 pi of single stranded cDNA, 60 mM KC1, 15 mM 

30 Tris-HCl (pH 8.8), 1.1 mM MgCl 2 , 200 pVL of each dNTP, 20 pmol 
of each primer and 2u of Taq polymerase (Cetus) . The PCR 
temperature cycle parameters were: 94 °C for 1 minute (min) , 
50°C for 2 mins, 72 °C for 1 min, 30 cycles. For each RNA 
sample, control amplifications were performed on samples which 

35 had not subjected to reverse transcription (RT-controls) . 

The amplified PTK PCR products were electrophoresed 
through 3% agarose gels and purified using "Mermaid" reagents 
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(Bio 101 inc) . PCR products were subcloned into the vector 
PCR II (invitrogen) according to supplier's instructions. 
Bacterial cell lysates from the involved node PTK fragment 
library were spotted on to duplicate Hybond N membranes. PTK 
PCR products amplified at higher levels from tumour bearing 
lymph nodes with respect to non-involved nodes were identified 
by differential DNA hybridisation, using the protocol of 
Crompton et al, Nacl. Acids Res. , 20, 4107-4108 (1992). 



RT-PCR expression studies were carried out using specific 
primers based on the brk Xt2 cDNA sequence (sense: 485-502, 
antisense: 1008-989) and the human glycer aldehyde 3 -phosphate 
dehydrogenase (GAPDH) cDNA sequence (Arcari et al., Nacl. 
Acids Res., 12, 9179-9189 (1984)) (sense: 71-96, antisense 
15 1030-1053). Reverse transcription and PCR We carried out 
as described above, with all four primers in the same 
reaction. RT-controls were performed for each sample. After 
electrophoresis through 3% agarose gels, amplified GAPDH 
products were detected by ethidium bromide staining. 
Amplified brk products were detected by Southern transfer to 
Hybond-N membranes (Amersham) and probing with radiolabeled 
Xt2 cDNA sequences (Sambrook et al., Molecular Cloning:' A 
Laboratory Manual, Cold Spring Harbor Laboratory Press 
(1989)) . 



RACE PCR 



30 



CDNA sequences 3' of the region amplified by the PTK PCR 
were isolated by a modification of the RACE protocols 
described by Frohman et al. (Proc. Natl. Acad. Sci. (USA), 85, 
8998-9002 (1988)). T-47D RNA was reverse transcribed using 
a 'tagged' oligo dT primer as shown in 
SEQ ID NO. 7. 

Two specific sense oligodeoxynucleotides were used for the brk 
35 kinase sequence in consecutive PCR reactions. These were as 
shown in 

SEQ ID NO. 8 

SEQ ID NO. 9. 
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The reaction buffer was the same as described above except the 
MgCl 2 concentration was 1.5 mM. The PCR cycle parameters 
consisted of a single denaturation step at 92 °C for 2 mins, 
followed by 30 cycles at 92°C for 1.25 mins, 56°C for 2 mins 
5 and 72 °C for 2 mins. 

5' RACE PCR was carried out using the Life Technologies 
5 ' RACE system according to manufacturer's instructions. The 
two nested brk specific antisense oligodeoxynucleotides were 
10 as shown in 

SEQ ID NO. 10 
SEQ ID NO. 11. 

The PCR cycle parameters consisted of a single denaturation 
step at 92 °C for 2 mins followed by 30 cycles at 92 °C for 1 
15 min, 55°C for 1.5 mins and 72°C for 1.5 mins. 

Constructions and screening of cDNA libraries 

Cell line cDNA libraries were prepared in the vector 
20 lambda 2APII (Stratagene) using reverse transcriptase 
(Superscript II, Life Technologies) and a • uniZAP-cDNA 
synthesis kit (Stratagene) . The complexities of the T-47D and 
MCF-7 libraries were 2.1xl0 6 and 5xl0 6 . Libraries were 
screened using Hybond N nylon membranes (Amersham) . 32 P 
25 radiolabelled DNA probes were hybridised to the membranes 
according to manufacturer's instructions. Filters were washed 
in 2 x SSC, 0.1% SDS at 65°C and finally in 0.1 x SSC, 0.1% 
SDS at 65 °C. 

30 DNA sequencing 

cDNA inserts were sequenced by the dideoxy method (Sanger 
et al., 1977) using the Sequenase version 2 sequencing kit 
(United States Biochemicals) after generating exonuclease III 
35 nested deletions in the phagemids SK" and KS* (Stratagene) and 
rescuing single stranded templates as described by the 
supplier. 
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Baculoviral expression of brk 

The br£ cDNA Xt2 was subcioned into the baculovirus 
transfer vector P VL1392 (Invitrogen) . This was co-transf ected 
with BaculoGold (Pharmingen) viral DNA onto s D odo P t»™ 
f ruqiperda (Sf2l) insect cells and three days later the ( medium 
was used in an agarose plaque assay. Recombinant plaques 
containing brjc were isolated, characterised, and" used to 
generate high titr,e viral stocks (>3 x 10 s pfu/ml) . Monolayer 
cultures of Sf-21, cells were , subsequently infected at a 
multiplicity of infection of 5 pfu/cell and analysed after 48 
hrs. Insect cell lysates were prepared as previously 
described (Page et al . , J. Biol. Chem. , 264., 19147-19154 
(1989)). 

) ■ "" ' ' 

Bacterial expression of brk and immunodetection 

j 

A Smal-HindHI fragment from clone Xt2, encoding amino 
acids 3-451, was cloned into the bacterial expression plasmid 
PGEX-4T-3 (Pharmacia). Bacterial cultures expressing this 
Plasmid were grown at 37°C in L-brbth containing 100 /xg/ml 
ampicillin and induced with 0.1 mM isopropyl-B4 D - 
thiogalactopyranoside for 2 hours. Induced bacteria were 
lysed by boiling in SDS-PAGE sample buffer. 

Cell lysates were separated on 10% SDS-polyacrylamide 
gels and transferred to nitrocellulose membranes. Membranes 
were blocked by incubation in phosphate-buffered saline (PBS) 
containing 5% (w/v) non-fat dried milk-protein, washed with 
PBS/0.05% Triton X100, and incubated with the anti- 
phosphotyrosine monoclonal antibody 4G10 (UBI) (2 ng/ml) in 
LIS medium (Gibco BRL) containing 10% (v/v) FCS. The antibody 
was detected using the ECL detection reagents (Amersham) as 
according to manufacturer's instructions. 
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Construction of expression vectors for expression of brk in 
mammalian cells 

The brk cDNA Xt2 was isolated from the cDNA library as 
a subclone between the EcoRI and Xhol sites of pBluescript SK 
(-) (Strata-gene) . The cDNA was excised by digestion witlh Kpnl 
and BamHI, its ends were blunted, and it was subcloned into 
the Smal site of pBluescript KS (+) (Stratagene) . "Ligation 
in one orientation produced a construct whereby digestion with 
Hindlll produced a fragment carrying the full brk coding 
sequence. This Hindlll fragment was ligated into either 
pRcCMV (Invitrogen) or pREP8 (Invitrogen) , and in each case 
clones with the 'sense' orientation were selected. 

Stable transfection of human mammary epithelial cells 

The mammary epithelial cell line HB4a has shown itself 
to be resistant to transfection and a modified calcium 
phosphate transfection technique was used specifically for 
these cells to ensure successful 1 stable expression of the 
vector. This was achieved using an episomal vector to provide 
stable expression of the brk cDNA and uniquely demonstrate a 
transformed phenotype. Two parameters proved critical to 
achieving increased transfection efficiency: 

1. Use of OPTIMEM, an a -MEM based medium (Gibco) , for 
seeding and transfection. A special feature of this medium 
is reduced phenol red (a pH indicator dye which is present in 
all media) and it is believed that the dye may interfere with 
transfection of HB4a cells. 

2. A specific coarseness of calcium phosphate-DNA 
complex achieved by mixing DNA and calcium chloride with 
immediate vortexing for a duration of six seconds. The cells 
were then exposed to the precipitate overnight, shocked with 
15% glycerol for exactly 2.5 min and then returned to standard 
growth medium. 
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RKase protection 

A Pst I fragment of the Xt2 brk cDNA (bases 302-607) was 
subcloned into the vector pBluescript (Stratagene) . The pTRI- 

5 GAPDH vector, containing a 316 bp human glyceraldehyde 3- 
phosphate dehydrogenase cDNA fragment, was obtained from 
Ambion Inc. Radiolabeled antisense RNA probes were 
transcribed from linearised vectors using standard procedures 
(Sambrook et al., Molecular Cloning: A Laboratory Manual, 

0 Cold Spring Harbor Laboratory Press (1989)). . Probe 
purification and RNase protection were carried out using 
protocols and reagents in the 'Ambion RPA II' kit. Protected 
fragments were resolved on DNA sequencing polyacrylamide/urea 
gels, and detected in dried gels by autoradiography. 

5 

RESULTS 



20 



25 



Isolation of ptk cDNAs differentially expressed in a breast 
tumour metastasis 



RNA was isolated from involved and non-involved axillary 
nodes removed from a patient with metastatic breast carcinoma. 
The RNA was subjected to reverse transcription-polymerase 
chain reaction (RT-PCR) using degenerate oligonucleotides 
corresponding to the highly conserved amino acid sequences 
present in subdomains VI (SEQ ID NO. 5) and IX (SEQ ID NO. 6) 
of the PTK catalytic domain (Hanks et al., Science, 2AX, 42-52 
(1988)). The pool of PTK cDNA fragments amplified from 
involved node RNA was subcloned to create a plasmid library 
30 of PTK cDNAs. Radiolabeled probes prepared from the two 
pools of PTK fragments amplified from the involved and non- 
involved node RNAs were used to screen duplicate platings of 
the PCR fragment library. Of 50 duplicate clones screened, 
4 were found to hybridise preferentially to the probe prepared 
35 from the PTK cDNA fragment pool amplified from the involved 
node RNA. Sequencing of these clones identified three 
different cDNA fragments whose deduced amino acid sequences 
inferred that they encoded PTKs. One of the cDNA fragments 
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(designated PTK23) encoded an amino acid sequence showing 
homology to members of the src PTK family blk., c-src and c- 
yes. Further characterisation of this novel putative PTK 
named brjc (breast tumour kinase) is described herein. 

Isolation of brk cDNAs 

Initially, a 600 base pair (bp) • brk cDNA fragment was 
generated by 3' RACE (rapid amplification of cDNA ends) PCR, 
using two specific nested oligonucleotides corresponding to 
the brk sequence between the two degenerate primers. This 3' 
RACE fragment was then used in expression studies to identify 
human breast tumour cell lines and normal ■ tissues that 
expressed the brk transcript... Northern analysis using the 600 
bp fragment on total RNA isolated from a range of human breast 
tumour cell lines (MCF^7, T-47D, MDA MB 468, -415, -157 and 
BT474, data not shown) did not detect any brk transcripts. 
Using the same fragment as a probe on Northern blots 
containing poly A* selected RNA, two transcripts in the size 
20 range of 2.3 kb and 2.6-2.8 kb were detected in the two human 
breast tumour cell lines tested (T-47D and MCF-7) . 

Figure 1 shows expression of brk. transcripts in human 
tissues and breast tumour cell lines. Northern blots, 

25 containing 2 ng of poly A* RNA isolated from human tissues and 
two human breast tumour cell lines, were hybridised with a 
human brJs probe (upper panel) and 6APDH probe (lower panel) . 
The blots subjected to autoradiography for 10 days ( brk ) and 
36 hours (GAPDH) . The mobilities of the molecular weight 

30 markers are indicated in kb. 

Subsequently, the 600 bp fragment was used to screen an 
unamplified cDNA library prepared from the T-47D cell line. 
From a screen of 1 x 10 6 phage clones, two brk cDNAs were 
35 isolated, both of which contained inserts of approximately 2.5 
kb. 

Figure 2 shows a schematic representation of the human 
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brk cDNA showing selected restriction sites, the relative 
positions of the end of the cDNA clones (Xtl,2 and Xm3-5) and 
the positions of the 5' and 3' RACE-PCR generated fragments. 
The shaded region represents the open reading frame present 
in clone Xt2. Pi represents the 0.3 kb Pstl-Pstl fragment 
used in the RNase protection assay. 

Sequencing of the slightly longer clone Xt2 identified 
an open reading frame (ORF) of 1353 bp starting with a 
consensus sequence' for efficient translational initiation 
(Kozak, J. cell. Biol., 108, 229-241, (1989)) and terminating 
with a TGA codon at position 1392. The ORF was preceded by 
a 5' sequence of 28 bp that did not contain any in-frame stop 
codons . 

The possibility arose that the cDNA clone Xt2 did not 
contain the .complete brk coding region., In order, to identify 
potentially longer cDNAs, a radiolabeled restriction fragment 
corresponding to the 5' end of clone Xt2 was used as a probe 
to screen 1 x lo 6 phage clones , from a MCF-7 cell line 
unamplified cDNA library. From this screen three cDNAs (Xm 
3-5) were isolated (Figure 2), one of which (Xm4) had an 
additional 17 bp of 5' sequence. No initiation or stop codons 
in frame with the ORF were identified in this upstream 
sequence. m addition to these studies, 5' RACE PCR was 
carried out using nested oligonucleotide primers, 
corresponding to the 5' end of clone Xt2, on poly A* RNA 
isolated from T-47D and MCF-7 cell lines. Subcloning and 
sequencing of the amplified products indicated that no more 
upstream sequence had been isolated (Figure 2). Expression 
of clone Xt2, by recombinant baculovirus infection of Sf21 
cells, resulted in the synthesis of a protein whose mobility 
on SDS-polyacrylamide gels was consistent with the predicted 
molecular weight of the protein encoded by the ORF present. 

Figure 3 shows baculoviral and bacterial protein 
expression of brk.. Lysates from insect cells infected with 
wild type baculovirus (AcPNV) and a recombinant baculovirus 
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containing the brk cDNA Xt2 were subjected to SDS-PAGE (panel 
A) . Proteins were visualised by staining with Coomassie blue 
G-250. Arrows indicate the positions of the wild type 
polyhedrin protein and the recombinant brjc protein. The same 
5 insect cell lysates (panel B) and lysates from bacteria 
containing plasmids expressing GST or GST-brk fusion proteins 
(panel C) were subjected to SDS-PAGE' and transferred to nylon 
membranes. Phosphotyrosine containing proteins were "detected 
using the antibody 4G10. Duplicate membranes were incubated 

10 with only the detection reagents' to confirm that detection was 
-dependant on the— 4X310 antibody. The arrows indicate the 
positions of polyhedrin (panel B) and GST proteins (panel C) , 
as determined by Ponceau S staining of the membranes. The 
mobilities of the molecular weight markers * are indicated ,in 

15 kDa. 

In addition, in vitro translation in rabbit reticulocyte 
lysates using transcripts generated from clone Xt2 resulted 
in the synthesis of a single protein with the same mobility 
20 (data not shown) . These experiments indicated that the 
predicted start codon in the brk cDNAs could act as ,an 
initiation site for translation and, therefore, that clone Xt2 
most likely contained the complete brk coding region. 



25 



brjc encodes a novel putative non-receptor kinase 



30 



The 1353 bp open reading frame present in cDNA Xt2 
encodes a protein of 451 amino acids with a predicted 
molecular weight of 52 kDa. The complete nucleotide sequence 
and amino acid translation of Xt2 are shown in SEQ ID NO. 1. 
The amino acid sequence between positions 197-432 of brk has 
several motifs found in other PTK catalytic domains (Hanks et 
al.. Science, 241, 42-52 (1988)). The putative ATP-binding 
motif, GSGYFG, is at position 198-203, with a downstream 
35 lysine residue at 219. The putative autophosphorylation site 
in the catalytic domain is conserved at position 342. In brk 
the strong indicator sequence of PTK specificity in subdomain 
VI corresponds to DLAARN (position 312-317) . This motif is 
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shared with abl, f£s, csJs and bjDlcfatk.) , but not with the src 
subfamily genes (DLRAAN) . Amino acid sequences similar to SH2 
(78-169) and SN3 (15-68) domains are also present. The 
predicted amino terminal methionine of brk. is followed by 
valine and residue 7 is alanine. brjc therefore lacks the 
residues thought to be necessary for post-translational N- 
terminal myristoylation (Kaplan et al., Mol. Cell. Biol., fi, 
2435-2441 (1988) ) . The short carboxyl terminus following the 
tyrosine kinase domain of brk contains' a potential regulatory 
tyrosine (position 447) equivalent to tyrosine 527 of c-src r 
but it is found in a very different sequence context. The 
predicted amino-terminal region of brjg, beyond the putative 
SH3 domain, is very short and is equivalent in length to that 
of csk (Nada et al , Nature, 3J51, 69-72 (1991)'; Partanen et 
al., Oncogene, <5, 2013-2018 (1991); Brauninger et al., Gene, 
110, 205-211 (1992)). 

Expression of brk. in human tissues and breast tumour cell 
lines 

As mentioned previously, b^k. transcripts were not 
detected when Northern blots containing total RNA isolated 
from a variety of breast tumour cell lines were screened with 
the 600 bp brk. 3' RACE fragment. Two transcripts in the size 
range of 2.3 kb and 2.6-2.8 kb were detected on Northern blots 
containing poly A* mRNA isolated from the breast tumour cell 
lines MCF-7 and T-47D. No brk transcripts were detected on 
Northern blots containing poly A* mRNA isolated from normal 
human heart, placenta, brain, lung, liver, skeletal muscle, 
kidney and pancreas (Figure 1) . 

Using RNase protection and RT-PCR assays, we examined the 
expression of brk. in normal breast samples (reduction 
mammoplasty) and in grade III breast carcinomas, in the RNase 
protection assay, a 305 bp protected fragment of similar 
intensity to that detected in T-47D RNA was detected in one 
out of five grade III breast carcinoma RNAs but not in two 
normal breast samples. 
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Figure 4 shows detection of brk mRNA expression in normal 
breast and grade III breast carcinoma tissue by RNase 
protection. Total RNA samples were isolated from yeast, T-47D 
cell line, two different normal human breast samples and five 
different grade III breast carcinomas. RNAs were hybridised 
with a brk radiolabeled antisense RNA derived from the Pi 
cDNA fragment (Figure 2) or with an antisense GAPDH RNA probe. 
After RNase digestion and separation on a sequencing gel, 
products were detected by autoradiography for 4 days at -70 °C. 



Figure 5, shows detection of brk mRNA expression in normal 
breast and grade III breast carcinoma tissue by PCR. Specific 
primers were used to simultaneously amplify brk and GAPDH 
sequences present in cDNA synthesised from normal breast (N) 

15 or grade III breast, carcinoma (T) RNAs. Products were 
resolved on agarose gels, and the amplified GAPDH products 
were visualised with ethidium bromide (lower portion of each 
panel) . After transfer to nylon membranes, brk products were 
detected by probing with a specific radiolabeled cDNA and 

20 autoradiography (upper portion of, each panel). Lanes 
immediately to the left of the visualised GAPDH sequences 
contain the products of amplification from the corresponding 
non-reverse-transcribed RNAs. The upper panel corresponds to 
the same RNA samples as shown in Figure 4, excepting the left 

25 hand 'normal' sample in Figure 4 which is omitted here. The 
two lower panels represent an additional 4 normal and 7 tumour 
samples . 



By RT-PCR, a 600 bp reverse transcriptase-dependent brk 
product was specifically amplified from the tumour RNA which 
gave a positive signal in the RNase protection assay, when the 
same tumour and normal breast RNAs were analysed (Figure 5, 
upper panel) . Analysis of further normal breast and grade III 
breast carcinoma RNAs by RT-PCR detected another three out of 
seven tumours that expressed brk transcripts, whereas no brk 
products were amplified from four normal breast RNA samples 
(Figure 5, lower panels). 
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Baculoviral and bacterial expression of brk cDNA 

A recombinant baculovirus was generated containing the 
entire Xt2 cDNA under the control of the polyhedrin promoter. 
Infected insect cells expressed a novel protein with an SDS- 
PAGE mobility of approximately 50 kDa (Figure 3, A), 
consistent with the predicted molecular weight of 52 kDa/ A 
bacterial vector was constructed to express the entire brk 
open reading frame, except for the first two N-terminal amino 
acids, as a glutathione S-transf erase (GST) fusion protein. 
Cell lysates from bacterial cultures containing the GST-be^ 
vector expressed an approximately 70 kDa fusion protein. 

Western blots of both insect and bacterial cell lysates 
"using" an ant i -phbsphotyr os ine antibody demonstrated that the 
baculovirally expressed 50 kDa protein and the bacterially 
expressed 70 kDa fusion protein were the major immunoreactive 
polypeptides present (Figure 3) . No immunoreactive 

polypeptides were present in insect cell lysates infected with 
a wild type baculovirus and cell lysates from bacteria 
expressing GST alone (Figure 3). 

Expression ana function of brk cDNAs in mammalian cells 

N1H3T3 (clone b2B) cells were transfected either with 
pRcCMV or its derivative carrying the brk cDNA. The cells 
were selected with 1 mg/ml Genet icin, and clones of drug 
resistant cells were pooled. These populations were seeded 
at either lxlO 4 or 1x10 s per well of a six well dish in 0.4% 
agar, and left for six weeks. At the end of this period, 20- 
30 j«n colonies were observed with the cells containing the 
transfected brk cDNA, whereas none were observed with the 
cells transfected with the empty vector. In multiple seedings 
of lxlO 4 'brk' cells, the numbers of colonies obtained were 
44, 54 and 0; in the 1x1 0 5 seedings there were 81, 41, 50 and 
42 colonies. Of 55 of these colonies, 32 were successfully 
isolated and grown on, whereas none of the cell aggregates 
picked from the 'vector only' cells in agar were capable of 
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further proliferation* Seven of the expanded 'brk' clones 
were chosen, at random and placed back in soft agar; five of 
these produced 50 fxm colonies with 100% efficiency within 
seven days when seeded at lxlO 4 (lower efficiencies were 
observed at higher seeding densities) . These data indicate 
that feElc expression in N1H3T3 cells results in anchorage 
independent proliferation, a feature of neoplastically 
transformed cells. The clones, of brk transformed N1H3T3 cells 
will be useful tools for the study of brk function, and for 
screening potential brk inhibitors. 

HB4a f a conditionally immortalised human breast lumenal 
cell line (developed by Dr. M. O'Hare) , was transfected with 
pREP8 or its derivative containing the brk cDNA, and 
transfectants were selected in lmM histidinol. For each 
plasmid, 50-60 drug resistant colonies were pooled and 
maintained under selection. The cells were seeded at 1x10 s in 
duplicate in 0.4% agar and left for six weeks. The brk 
transfected cells produced an average of 373 50 irn clones, 
whereas the empty vector cells produced an average of 108, 
Thus, brk expression in these cells enhances their capacity 
to proliferate in an anchorage independent manner. 

An alternative brk cDNA 

Sequencing of brk cDNA clone Xm5 revealed a deletion 
corresponding to nucleotides 259 to 380 inclusive of SEQ ID 
NO. 1. The predicted protein product is identical to that in 
SEQ ID NO. 2 from the N-terminus up to and including residue 
77. Following the deletion, the nucleotide sequence continues 
to correspond to that shown in SEQ ID NO. 1 (nucleotides 381 
onwards) but with the reading frame changed so that the amino 
acid sequence diverges. The ensuing predicted amino acid 
sequence is shown in SEQ ID NO. 12 which is followed by a stop 
codon . 

This protein would, therefore, comprise an SH3 domain 
identical in sequence to that of brk , but lack the SH2 and 
catalytic domains. Since SH3 domains may mediate protein- 
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protein interactions, this protein may regulate brk function 
by competing for binding to specific protein partners. 



I 

! 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANTS 

(A) NAME: The Institute of Cancer Research 

(B) STREET: 17A f Onslow Gardens 

(C) CITY: London 

10 (E) COUNTRY: Great Britain ' 

<F) POSTAL CODE (ZIP): SW7 3AL 

(A) NAME: The Wellcome Foundation Limited 

( B ) STREET: Unicorn House, 160 Euston Road 
15 (C) CITY: London 

(E) COUNTRY: f Great Britain 

(F) POSTAL CODE (ZIP) NW1 2 BP 



2 0 (-ii-)— TITLE— OF— INVENTION: — Signal"ling - Mo'lecurle 

(iii) NUMBER OF SEQUENCES: 12 

(iv) COMPUTER READABLE FORM: ' 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible — ^ 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2507 base pairs 

(B) TYPE: nucleic acid 
„ (C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
40 (A) NAME /KEY: CDS 

(B) LOCATION: 29.. 1381 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Is 

4 5 

CCTGGTCCTG CCGCTGCGCC CGCCCGCC ATG GTG TCC CGG GAC CAG GCT CAC 52 

Met Val Ser Arg Asp Gin Ala His 
1 5 

50 CTG GGC CCC AAG TAT GTG GGC CTC TGG GAC TTC AAG TCC CGG ACG GAC 100 
Leu Gly Pro Lys Tyr Val Gly Leu Trp Asp Phe Lys Ser Arg Thr Asp 
10 15 20 

cc G ^ G CTG AGC 1X0 CGC GCG GGG GAC GTC TTC CAC GTG GCC AGG AAG 148 

55 Glu Glu Leu Ser Phe Arg Ala Gly Asp Val Phe His Val Ala Arg Lys 
25 30 35 40 

GAG GAG CAG TGG TGG TGG GCC ACG CTG CTC GAC GAG GCG GGT GGG GCC 196 
Glu Glu Gin Trp Trp Trp Ala Thr Leu Leu Asp Glu Ala Gly Gly Ala 
45 50 55 

GTG GCC CAG GGC TAT GTG CCC CAC AAC TAC CTG GCC GAG AGG GAG ACG 244 
Val Ala Gin Gly Tyr Val Pro His Asn Tyr Leu Ala Glu Arg Glu Thr 
60 65 70 

GTG GAG TCG GAA CCG TGG TTC TTT GGC TGC ATC TCC CGC TCG GAA GCT 292 
Val Glu Ser Glu Pro Trp Phe Phe Gly Cys He Ser Arg Ser Glu Ala 
75 80 85 
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30 



10 



20 



30 



50 



60 



GTG CGT CGG CTG CAG GCC GAG GGC AAC GCC ACG GGC GCC TTC CTG ATC 340 
Val Arg Arg Leu Gin Ala Glu Gly Asn Ala Thr Gly Ala Phe Leu lie 
90 95 100 



AGC GTC AGC GAG AAG CCG AGT GCC GAC TAC GTC CTG TCG GTG CGG GAC 388 
Arg Val Ser Glu Lys Pro Ser Ala Asp Tyr Val Leu Ser Val Arg Asp 

105 no us ' 12 5 

^ G £AG GCT GTG CGG CAC TAC AAG ATC TGG CGG CGT GCC GGG GGC CGG 436 
Thr Gin Ala Val Arg His Tyr Lys He Trp Arg Arg Ala Gly Gly Arg 
I 25 130 135 

CTG CAC CTG AAC GAG GCG GTC TCC TTC CTC AGC CTG CCC GAG CTT GTG 484 
Leu His Leu Asn Glu Ala Val Ser Phe Leu Ser Leu Pro Glu Leu Val 
Ai> i4 ° 145 150 

1 

AAC TAC CAC AGG GCC CAG AGC CTG TCC CAC GGC CTG CGG CTG GCC GCG 532 
Asn Tyr His Arg Ala Gin Ser Leu Ser His Gly Leu Arg Leu Ala Ala 
I 55 160 165 

CCC TGC CGG AAG CAC GAG CCT GAG CCC CTG CCC CAT TGG GAT GAC TGG 580 
Pro Cys Arg Lys His Glu Pro Glu Pro Leu Pro His Trp Asp Asp Trp 

I 

25 GAG AGG CCG AGG GAG GAG TTC ACG CTC TGC AGG AAG CTG GGG TCC GGC 628 
Glu Arg Pro Arg Glu Glu Phe Thr Leu Cys Arg Lys Leu Gly Ser Gly 
185 190 195 200 



TAC TTT GGG GAG GTC TTC GAG GGG CTC TGG AAA GAC CGG GTC CAG GTG ' 676 
Tyr Phe Gly Glu Val Phe Glu Gly Leu Trp Lys Asp Arg Val Gin Val 
205 210 215 

GCC ATT AAG GTG ATT TCT CGA GAC AAC CTC CTG CAC CAG CAG ATG CTG 724 
Ala lie Lys Val lie Ser Arg Asp Asn Leu Leu His Gin Gin Met Leu 
J5 220 225 230 

CAG TCG GAG ATC CAG GCC ATG AAG AAG CTG CGG CAC AAA CAC ATC CTG 772 
Gin Ser Glu lie Gin Ala Met Lys Lys Leu Arg His Lys His He Leu 
4 0 235 240 245 

GCG CTG TAC GCC GTG GTG TCC GTG GGG GAC CCC GTG TAC ATC ATC ACG 820 
Ala Leu Tyr Ala Val Val Ser Val Gly Asp Pro Val Tyr He He Thr 
250 255 260 

45 GAG CTC ATG GCC AAG GGC AGC CTG CTG GAG CTG CTC OGC GAC TCT GAT 868 
Glu Leu Met Ala Lys Gly Ser Leu Leu Glu Leu Leu Arg Asp Ser Asp 
265 270 275 280 

GAG AAA GTC CTG CCC GTT TCG GAG CTG CTG GAC ATC GCC TGG CAG GTG 916 
Glu Lys Val Leu Pro Val Ser Glu Leu Leu Asp He Ala Trp Gin Val 
285 290 295 



w TG TAC CTC GAG TCG CAG TAC ATC CAC CGG GAC 964 

" " " His 

310 



Ala Glu Gly Met Cys Tyr Leu Glu Ser Gin Asn Tyr He His Arq Asp 
00 300 305 



CTG GCC GCC AGG AAC ATC CTC GTC GGG GAA AAC ACC CTC TGC AAA GTT 1012 
Leu Ala Ala Arg Asn He Leu Val Gly Glu Asn Thr Leu Cys Lys Val 
315 320 325 

GGG GAC TTC GGG TTA GCC AGG CTT ATC AAG GAG GAC GTC TAC CTC TCC 1060 
Gly Asp Phe Gly Leu Ala Arg Leu He Lys Glu Asp Val Tyr Leu Ser 
330 335 34 5 

65 CAT GAC CAC AAT ATC CCC TAC AAG TGG ACG GCC CCT GAA GCG CTC TCC 1108 
His Asp His Asn He Pro Tyr Lys Trp Thr Ala Pro Glu Ala Leu Ser 
345 350 355 360 
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CGA GGC CAT TAC TCC ACC AAA TCC GAC GTC TGG TCC TTT GGG ATT CTC 
Arg Gly His Tyr Ser Thr Lys Ser Asp Val Trp Ser Phe Gly lie Leu 
365 370 t 375 

CTG CAT GAG ATG TTC AGC AGG GGT CAG GTG CCC TAC CCA GGC ATG TCC 
Leu His Glu Met Phe Ser Arg Gly Gin Val Pro Tyr Pro Gly Met Ser 
380 385 2S0 

AAC CAT GAG GCC TTC CTG AGG GTG GAC GCC GGC TAC CGC ATG CCC TGC 
Asn His Glu Ala Phe Leu Arg Val Asp Ala Gly Tyr Arg Met Pro Cys 
395 400 405 

CCT CTG GAG TGC CCG CCC AGC GTG CAC AAG CTG ATG CTG ACA TGC TGG 
Pro Leu Glu Cys Pro Pro Ser Val His Lys Leu Met Leu Thr Cys Trp 
410 415 i 420 

TGC AGG GAC CCC GAG CAG AGA CCC TGC TTC AAG GCC CTG CGG GAG AGG 
Cys Arg Asp Pro Glu Gin Arg Pro Cys Phe Lys Ala Leu Arg Glu Arg 
425 430 435 440 

CTC TCC AGC TTC ACC AGC TAC GAG AAC CCG ACC TGAG CTG CTG TGGAGCGGGC 
Leu Ser Ser Phe Thr Ser Tyr Glu Asn Pro Thr 
445 450 

25 ATGGCCGGGC CCTGCTGAGG AGGGGCCTGG GCAGAGfcGCC TGGACCTGGCTATCAAGGCCC 
ACGCGCTTCC CTGGGGTTTA CTGAGGTGAT GGGTGCAGGA AAGGTTCACA AATGTGGAGT 
GTCTGCGTCC AATACACGCG TGTGCTCCTC TCCTTACTCC ATCG TGTGTG CCTTGGGTCT 
CAG CTG CTG A CACGCAGCCT GCTCTGGAGC CTG CAG ATG A GATCCGGGAG ACTGACACGA 
AGCCAGCAGA GGTCAGAGGG GACTCTGACC ACAGCCCGCT CTCTGGCTGT CTGTCTG CAG 



10 



15 



20 



30 



35 



40 



45 



50 



55 



60 



TGCCCGGCTG AGGGTGGGAG GCAAACACGC CTTGTTCCTG CTCTTCCCAG TTCAGCTTGG 

t 

TGGGAGAAAG TCATTCGCGT GGCTCGGGAC GCTCATGTAA ATTTGGTTTT GGTGCTCAAG 
GGTTCTTTCC TCCCAGGGGC AGGTGTTTCT TTCCTGTTTG TCTTGTGTCT TGAGAGCTTG 
G CCTTATG AC CAGTGAGAAC TCTCTCCCTG GTCTCTGCCA GCCCAAGCAT CACTGCCCGA 
GGCGCCAGCT CAGTTTCACC GTCCACGTCC ACAAGGGGCT TTTCCCACCT TCACCTTTGT 
CGCTGGGTCA GTGCTGGAAA GCGCCCCTCA CTCCTGCGCT GACAAGGGCC CTTCTCTACT 
GTCTGTGGGG TGGTTCCGGG CTGGGGGGGC TGCCTCCTTT GCACCTGATT TTGAAGGTGT 
CTCTTTCATC CATGG TTAAG TCATAAAAAG CTTATTGGTT TTGG TTTTG A CTCACCTGAA 
AGTTTTTTTG GTTTAAAAGA AGAATAGGCG GGGCACGGTG GCTCGTGCCT GTAATCCCAG 
CACTTTGGGA GGCTGAGGCA GGTGGATCAC GAGGTCAGGA GATCGACACC ATCCTGGCTA 
ACACGGTGAA GCCCCGTCTC TACTAAAAAA TACAAAAAAT TAGCTGGGTG TGGTGGTGGG 
GGTGGGCGCC TGTGGTCCCA GCTACGTGGG AGGCTGAGGC AGCAGACTGG TGTGAACCCG 
GGAGGTGGAG CTTGCAGTGA G CCG AGGTCG CGCCACTGCA CTCCAGCCTG GGCGACAGAG 
CGAGACTCCA TCTCAAAAAA AAAAAA 



1156 

1204 

1252 

1300 

1348 

1401 

1461 

1521 

1581 

1641 

1701 

1761 

1821 

1881 

1941 

2001 

2061 

2121 

2181 

2241 

2301 

2361 

2421 

2481 

2507 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(I) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 451 amino acids 
5 (B) TYPEs amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID, NO: 2: 

Met Val Ser Arg Asp Gin Ala His Leu Gly Prq Lys Tyr Val Gly Leu 
1 5 10 15 

15 Trp Asp Phe Lys Ser Arg Thr Asp Glu Glu Leu Ser Phe Arg Ala Gly 

20 25 30 



20 



30 



35 



45 



50 



60 



65 



Asp Val Phe His Val Ala Arg Lys Glu Glu Gin Trp Trp Trp Ala Thr 
35 40 45 

Leu Leu Asp Glu Ala Gly Gly Ala Val Ala Gin Gly Tyr Val Pro His 
50 55 60 

Asn Tyr Leu Ala Glu Arg Glu Thr Val Glu Ser Glu Pro Trp Phe Phe 
25 65 70 75 80 

Gly Cys He Ser Arg Ser Glu Ala Val Arg Arg Leu Gin Ala Glu Gly 
85 90 95 

Asn Ala Thr Gly Ala Phe Leu He Arg Val Ser Glu Lys Pro Ser Ala 
100 105 110 

Asp Tyr Val Leu Ser Val Arg Asp Thr Gin Ala Val Arg His Tyr Lys 
115 120 125 

IlG ** g Arg Ala G1 y G1 y ^9 Leu His Le « Asn Glu Ala Val Ser 

I 30 135 , f 140 

ao L6U Ser LeU Pro Glu Leu Val Asn His Ala Gin Ser Leu 

* U 145 150 155 160 

Ser His Gly Leu Arg Leu Ala Ala Pro Cys Arg Lys His Glu Pro Glu 
165 170 175 

Pro Leu Pro His Trp Asp Asp Trp Glu Arg Pro Arg Glu Glu Phe Thr 
180 185 



190 



Leu Cys Arg Lys Leu Gly Ser Gly Tyr Phe Gly Glu Val Phe Glu Gly 
195 200 205 

Leu Trp Lys Asp Arg Val Gin Val Ala He Lys Val He Ser Arg Asp 
210 215 220 

cc £5 n Leu Leu His Gln Gln Met Leu Gin Ser Glu He Gin Ala Met Lys 
55 225 230 235 240 

Lys Leu Arg His Lys His He Leu Ala Leu Tyr Ala Val Val Ser Val 
245 250 255 

Gly Asp Pro Val Tyr He He Thr Glu Leu Met Ala Lys Gly Ser Leu 
260 265 270 

Leu Glu Leu Leu Arg Asp Ser Asp Glu Lys Val Leu Pro Val Ser Glu 
275 280 285 

Leu Leu Asp He Ala Trp Gln Val Ala Glu Gly Met Cys Tyr Leu Glu 
290 295 300 
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10 



25 



30 



Ser Gin Asn Tyr Ile Hi „ 8p „ eu M Ala Arg ftsn ^ ^ 

310 320 



33 

ls Arg Asp Leu Ala Ala Arg Asn lie 
10 315 

Gly Glu Asn Thr Leu Cy S Lys Val Gly Asp Phe Gly Leu Ala Arg Leu 
He Lys Glu Asp Val Tyr Leu Ser H^s Asp His Asn He Pro Tyr Lys 



Trp Thr Ala Pro Glu Ala Leu Ser Arg Gly His Tyr Ser Thr Lys Ser 

JbO 365 
Asp Val Trp ser Phe Gly lie L eu Leu His Glu Met Phe Ser Arg Gly 
15 375 380 

Gin Val Pro Tyr Pro Gly Met Ser Asn His Glu Ala Phe Leu Arg Val 

395 400 
20 ASP A1S Gly *»* J r | Met P " <** Pro Leu Glu Cys Pro Pro Ser Val 

His Lys Leu Met Leu Thr Cys Trp Cys Arg Asp Pro Glu Gin Arg Pro 



50 



430' 



Cys Phe Lys Ala Leu Arg Glu Arg Leu Ser Ser Phe Thr Ser Tyr Glu 



440 



445 



Asn Pro Thr 
450 



(2) INFORMATION FOR SEQ ID NO: 3: 

35 <*> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 W TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
45 GGAATTCTAG AMGSGACYTG GCVGCBAGRA AC 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
cc ( c > STRANDEDNESS: single 
33 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

60 < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGAATTCTAG ACACSCCRWA RSWCCASACR TC 



65 



32 



32 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids . ' 

5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID ,NO: 5: v 

His Arg Asp Leu Ala Ala Arg ABn i 
1 5 



15 



25 



45 



i 

(2) INFORMATION FOR SEQ, ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

30 (D) pTHER INFORMATION: /note= "Xaa at position 5 is Tyr or 

Phe" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
35 (B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa at position 7 is Val or 
lie- i ' 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Val Trp Ser Xaa Gly Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GACTCGAGTC GACATCGATT TTTTTTTTTT TTTTT 35 

60 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: , f 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCTCGTCGGG GAAAACACC 

(2) INFORMATION FOR SEQ ID NO: 9: i 

on (i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCTTATCAA GGAGGACGTC 

30 

(2) INFORMATION FOR SEQ ID NO: 10: 

35 (i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 17 base pairs t 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
45 CTGGCCACGT GGAAGAC 



50 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
__ (C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



60 



65 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTTGAAGTC CCAGAGG 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Gly His Ala Gly Cys Ala Ala Leu Gin Asp Leu Ala Ala Cys Arg 
1 5 10 J.5 

15 Gl y Pro Ala Ala Pro Glu Arg Gly Gly Val Leu Pro Gin Pro Ala Arg 

20 .25 30 



20 



Ala Cys Glu Leu Pro Gin Gly Pro Glu Pro Val Pro Arg Pro Ala Ala 
35 40 45 

Gly Arg Ala Leu Pro Glu Ala Arg Ala 
50 55 
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CLAIMS: 



1. A DNA isolate encoding a protein tyrosine kinase or a 
fragment thereof, the protein tyrosine kinase being 
5 characterised by the amino acid sequence shown in 
SEQ ID NO. 2 

or by an amino acid sequence showing a significant degree of 
homology thereto . 

10 2. A DNA isolate as claimed in Claim l wherein the amino 
acid- sequence-shows -at -ieast 60% homology in the catalytic 
domain and at least 40% homology in the remainder of the 
molecule to the sequence shown in SEQ ID NO. 2: 



15 3. A DNA isolate as claimed in Claim 1 wherein the amino 
acid sequence shows at least 90% homology to the sequence 
shown in SEQ ID NO. 2. 



4 . A DNA isolate as claimed in any of Claims 1 to 3 encoding 
the full amino acid sequence of the protein tyrosine kinase. 

5. A DNA isolate as claimed in any of Claims 1 to 3 encoding 
all or part of the catalytic domain of the protein tyrosine 
kinase. 

6. A DNA isolate as claimed in any of claims l to 4 wherein 
the DNA encoding the protein tyrosine kinase or a fragment 
thereof has the DNA sequence shown in SEQ ID NO. 1. 

30 7. a DNA isolate as claimed in any of claims 1 to 3 encoding 
amino acids 1 to 77 of 

SEQ ID NO. 2 
followed by the amino acids of 

SEQ ID NO. 12 

or an amino acid sequence showing a significant degree of 
homology thereto. 

8. A recombinant expression vector comprising a DNA as 
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10 



15 



20 



claimed in any of Claims l to 7 under control of an 
appropriate promoter and regulatory elements for expression 
in a host cell. 

9. A recombinant cell line transformed with an expression 
vector as claimed in Claim 8 , and capable of expressing a 
recombinant protein tyrosine kinase or fragment thereof 
characterised by the amino acid sequence shown in " 

SEQ ID NO. 2, 

or an amino acid sequence showing a significant degree of 
homology thereto. 

p 

10. A recombinant cell line as claimed in claim 9 which is 
a CHO, myeloma, primary' immortalised breast, rodent 
fibroblast, baculovirus, yeast or E. coli cell line. 

11. A protein tyrosine kinase or a fragment thereof 
characterised by the amino acid sequence shown in 

SEQ ID NO. 2 

or an amino acid sequence showing a significant degree of 
homology thereto. . 

i 

12. A protein tyrosine kinase or a fragment thereof as 
claimed in Claim 11 which is a recombinant polypeptide. 

13. A protein tyrosine kinase or a fragment thereof as 
claimed in Claim 11 which has been produced in a human tumour 
cell line. 

30 14. a protein tyrosine kinase or a fragment thereof having 
the amino acid sequence of amino acids 1 to 77 of 

SEQ ID NO. 2 
followed by the amino acids of 
SEQ ID NO. 12 

35 or an amino acid sequence showing a significant degree of 
homology thereto. 

15. A method of screening a substance for potential utility 
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as a therapeutic agent in the treatment of cancer which 
comprises providing a standard system in which a protein 
tyrosine kinase or an active fragment thereof is able to 
develop a measurable effect, allowing the protein tyrosine 
5 kinase to develop that effect in the presence and absence of 
the said substance and measuring that effect, ability to 
produce significant inhibition of trie effect being taken as 
an indication of potential utility as a therapeutic agent, 
wherein the protein tyrosine kinase is characterised by the 
10 amino acid sequence' shown in ' 
-SBQ ID-NQ. _2— 

or an amino acid sequence showing a significant degree of 
homology thereto. 

15 16. A method as claimed in Claim 15 wherein the protein 
tyrosine kinase or an active fragment thereof is derived from 
a recombinant expression system or a cellular source in which 
the protein is over-expressed. 

20 17. a method as claimed in Claim 16 wherein the recombinant 
expression system is an insect baculovirus, an immortalized 
human breast cell line or a rodent firoblast cell line. 



25 



30 



35 



18. A method as claimed in any of claims 15 to 17 wherein the 
measurable effect is protein tyrosine kinase activity which 
is measured by measuring autophosphorylation or 
Phosphorylation of a model substrate, by following 
incorporation of radioactive phosphate or by using 
anti-phosphotyrosine antibodies. 

19. A method as claimed in Claim 15 wherein the measurable 
effect is a phenotypic alteration conferred by expression of 
the protein tyrosine kinase or an active fragment thereof in 
a recombinant or non-recombinant cell line. 

20. A method as claimed in Claim 15 wherein the measurable 
effect is the tumorogenic effect of a recombinant or non- 
recombinant cell line expressing the protein tyrosine kinase 
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Hv of ^! WeUtiC a ' ent "entifiea by . method as claimed in 

any of Claxms 15 to 21 ■ Ln 

10 • ' 

23. A therapeutic agent as claimed in Claim 22 wh^h • 

chemical molecule of relative i« ? 15 a 

or relatively low molecular weight. 

15 It' A therap6Utic a * ent »■ claimed in Claim 23 which is a 
15 staurosporine analogue, a tyrphostin or a flavenoit 

pe'ptide." 116 ^ 10 ^ ° laiffled in Clai » which is a 

20 -III- A therapeUtic a * ent «■ claimed in claim 22 which is an 

SEQ ID NO. 2 

wltn?^," 1 ^ 0 ' WMCh C ° mPriSeS a test sample 

«t* * spec lfl c antibody raised against an antigen fro- Le 
saxd ». aold seance and determining whether there !" »y 
antigen antibody binding within the test sample. 

s 2 e m en=r h ° a , f0r deteCtl °" in • «* a nucleic acid 

«r s a"le 9 3 tyr ° Sine klnaSS " hlch subjecting 

»!.k, * " UCXei0 aCid isolated herefrom to a methol 

oTs^irirn" 9 a nuciei ° ac " se - ence — in 

and ^dentL h0 "" 0l09 ° US t0 * -=^ic acid sequence 

h^l, yin3 ^ SK5UenCe lden ««l or substantially 

homologous to all or part of the DNA sequence defined in 
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SEQ ID NO. 1, 

DNA sequence from another part(.s) of the tyrosine kinase 
characterised by that sequence or mRNA which would be the 
transcription product of such DNA. 

29. A method as claimed in Cla'im 28 wherein DNA is detected 
by. Southern blotting and/ or PGR using appropriate primers. 



30. a method as claimed in Claim 28 wherein RNA is detected 
by Northern blotting, RNAse protection studies and direct 
nucleic acid, in situ hybridisation. 

31. An enzyme substrate complex which comprises a protein 
tyrosine kinase characterised by the amino acdd sequence shown 
in 

SEQ ID NO. 2 

or by an amino acid sequence showing a significant degree of 
homology thereto and a therapeutic agent capable of modulating 
the activity of the said protein tyrosine kinase. 



32. 



A complex as claimed in Claim 30 wherein the therapeutic 
agent is a chemical molecule of relatively low molecular 
weight. 



33. A complex as claimed in Claim 32 wherein the therapeutic 
agent is a staurosporine analogue, a tyrphostin or a 
f lavenoid. 

34. A complex as claimed in Claim 30 wherein the therapeutic 
agent is a macromolecule. 

35. a complex as claimed in Claim 34 wherein the therapeutic 
agent is an antibody. 

36. a complex as claimed in Claim 30 wherein the therapeutic 
agent as a peptide. 

37. A complex as claimed in Claim 30 wherein the therapeutic 
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agent is an antisense oligonucleotide capable of blocking 
translation of the xorna encoding the protein tyrosine kinase 
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